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Abstract 

New upper and lower bounds are presented on the capacity of the free- 
space optical intensity channel. This channel is characterized by inputs that 
are nonnegative (representing the transmitted optical intensity) and by outputs 
that are corrupted by additive white Gaussian noise (because in free space the 
disturbances arise from many independent sources). Due to battery and safety 
reasons the inputs are simultaneously constrained in both their average and 
peak power. For a fixed ratio of the average power to the peak power the 
difference between the upper and the lower bounds tends to zero as the average 
power tends to infinity, and the ratio of the upper and lower bounds tends to 
one as the average power tends to zero. 

The case where only an average-power constraint is imposed on the input 
is treated separately. In this case, the difference of the upper and lower bound 
tends to as the average power tends to infinity, and their ratio tends to a 
constant as the power tends to zero. 

1 Introduction 

We consider a channel model for short-range optical communication in free space 
such as the infrared communication between electronic handheld devices. We assume 
a channel model based on intensity modulation, where the input signal modulates the 
optical intensity of the emitted light. Thus, the input signal is proportional to the 
light intensity and is therefore nonnegative. We further assume that at the receiver 
a front-end photodetector measures the incident optical intensity of the incoming 
light and produces an output signal which is proportional to the detected intensity. 
We model the ambient light conditions by a Gaussian disturbance. Moreover, we 
assume that the line-of-sight component is dominant and ignore any effects due to 
multiple-path propagation like fading or inter-symbol interference0 

Optical communication is restricted not only by battery power but also, for safety 
reasons, by the maximum allowed peak power. We therefore consider simultaneously 
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two constraints: an average-power constraint E and a maximum allowed peak power 
A. The situation where only a peak-power constraint is imposed, corresponds to 
E = K. The case of only an average-power constraint is treated separately. 

The described system is called the free-space optical intensity channel and has 
previously been studied in [1], [2], [3], [1], [5]. In [3] it has been proved that the 
capacity-achieving probability measure for this channel is discrete, and in [Jj, [3] 
upper and lower bounds on this channel's capacity have been derived. Related 
channel models used to describe optical communication are the Poisson channel, see 
[6], [7], p], [2] for the discrete-time channel and [9], [lO], [II], [E], [13], [T3], [T5] 
for the continuous-time channel, and a variation of the free-space optical optical 
intensity where the noise depends on the input [2 Chapter 4], [T6] . 

In this work we present new upper and lower bounds on the capacity of the 
free-space optical intensity channel and study the capacity's asymptotic behavior 
at high and low powers. The maximum gap between the upper and lower bounds 
never exceeds 1 nat when the ratio of the average-power constraint to the peak-power 
constraint is larger than 0.03 or when only an average-power constraint is imposed. 
For the case of average-power and peak-power constraints, asymptotically when the 
available average and peak power tend to infinity with their ratio held fixed, the 
upper and lower bound coincide, i.e., their difference tends to 0. When the available 
average and peak power tend to 0, with their ratio held fixed, the ratio of the upper 
and lower bound tends to 1. For the case of only an average-power constraint the 
proposed upper and lower bound coincide asymptotically for high power, i.e., their 
difference tends to as the power tends to infinity. At low power their ratio tends 
to 2V2. 

The derivation of the upper bounds is based on a general technique introduced 
in [IZj using a dual expression of mutual information. We will not state it in its 
full generality but only in the form needed in this paper. For more details and for 
a proof see [TTl Sec. V], ^ Ch. 2]. 

Proposition 1. Assume a memoryless channel with input alphabet X = M.q and 
output alphabet 3^ = M where conditional on the input x £ X the distribution on 
the output Y is denoted by the probability measure Then, for arbitrary 

distribution R{-) over y, the channel capacity under a peak-power constraint A and 
an average-power constraint E is upper-bounded by 

C{A,E) < snpEQ[D{W{-\X)\\R{-))] , (1) 
Q 

where the supremum is taken over all probability laws Q on the input X satisfying 
Q{X > A) = and Eq [X] < E. Here, D{-\\-) stands for the relative entropy 
Ch. 2J. 

Proof See [HI Sec. V]. □ 

There are two challenges in using ([T]). The first is in finding a clever choice of the 
law R that will lead to a good upper bound. The second is in upper-bounding the 

^The proposition requires certain measurability assumptions on the law M^(-|-) which we omit 
for simplicity. However, the channel law under consideration (see its density @ ahead) satisfies 
these assumptions. See |17l Sec. V], [2] Ch. 2] for a description of the assumptions. 
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supremum on the right-hand side of ([T]). To handle this second challenge we shall 
resort to some further bounding, e.g., Jensen's inequality [iHl Ch. 2.6]. 

To derive the lower bounds we apply two different techniques: one for the high- 
power regime, and the other for the low-power regime. For high powers we use the 
entropy power inequality (see Lemma [T6|) and the theory of entropy maximizing 
distributions |18t Ch. 11]. Asymptotically, the differences of these lower bounds 
and some of the upper bounds derived using duality tend to as the power tends 
to infinity, and thus the bounds are tight at high power. At low powers we lower- 
bound capacity considering binary input distributions; a choice which was inspired 
by and [3j. In the cases involving a peak-power constraint, the asymptotic 
behavior of the corresponding mutual information is studied using [2^. When only 
an average-power constraint is imposed, a lower bound on the asymptotic behavior 
of the mutual information is derived. In the cases involving a peak-power constraint 
the asymptotic expression of the mutual information for binary inputs and some of 
the duality-based upper bounds are asymptotically tight at low power, i.e., their 
ratio tends to one as the power tends to 0. When only an average-power constraint 
is imposed, the derived lower bound on the asymptotic expression of the mutual 
information is not tight with any of the duality-based upper bounds. Indeed, the 
ratio of the best upper bound with this lower bound tends to 2\/2 when the average 
power tends to 0. 

The rest of the paper is structured as follows. After some remarks about nota- 
tion at the end of this section, we define the considered channel model in detail in 
the subsequent Section [2j Section [3] contains some mathematical preliminaries. In 
Section m we state our main results, i.e., the upper and lower bounds on channel 
capacity and the asymptotic results. The detailed derivations of the lower bounds, 
the upper bounds, and the asymptotic results can be found in Sections [5l E] and [71 
respectively. 

For random quantities we use uppercase letters and for their realizations lower- 
case letters. Scalars are typically denoted using Greek letters or lower-case Roman 
letters. A few exceptions are the following symbols: C stands for capacity, 
denotes the relative entropy between two probability measures, and /(•; •) stands for 
the mutual information. Moreover, the capitals Q, W, and R denote probability 
measures: 

• Q{-) denotes a generic probability measure on the channel input; 

• for any input x £ X, W{-\x) represents a probability measure on the channel 
output when the channel input is x; 

• R{-) denotes a generic probability measure on the channel output. 

The expression I{Q, W) stands for the mutual information between input X and 
output y of a channel with transition probability measure W when the input has 
distribution Q, i.e., 

I{Q,W)^I{X;Y). (2) 

The symbol £ denotes average power and A stands for peak power. We denote the 
mean-T/, variance-cr^ real Gaussian distribution by A/'iR(ry, o"^) . All rates specified in 
this paper are in nats per channel-use, and all logarithms are natural logarithms. 
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2 Channel Model 



2.1 Physical Description 

In free-space optical communication the input signal usually is transmitted by means 
of light emitting diodes (LED) or laser diodes (LD). Conventional and most inex- 
pensive diodes emit infrared light of wavelength between 850 and 950 nanometers. 
For such high frequencies, practical systems often apply intensity modulation where 
the transmitter modulates the optical intensity of the emitted light, and hence the 
input signal is proportional to the optical intensity. The receiver first measures the 
incident optical intensity of the incoming light by means of a front-end photodetec- 
tor and produces an output signal which is proportional to the detected intensity. 
Based on this output signal the receiver decodes the transmitted data. 

For our model we neglect the impact of fading or inter-symbol interference due to 
multiple-path propagation and assume that the direct line-of-sight path is dominant. 
In the absence of a protective medium like, e.g., a fiber cable, the dominant noise 
source is assumed to be strong ambient light. Even if optical filters are applied 
to reduce the impact of this noise, it typically has much larger power than the 
actual signal and causes high-intensity shot noise in the output signal. In a first 
approximation this shot noise can be assumed to be additive and independent of the 
signal itself. 

The maximum allowed optical peak power of the transmitted signal has to be 
constrained, e.g., to guarantee eye safety. Moreover, to increase battery life-time, 
we also constrain the allowed optical average power. Since the optical power is 
proportional to the optical intensity, and the input signal modulates the optical 
intensity, in the described system the optical power is proportional to the input 
signal. Thus, the constraints on the optical peak and average power have to be 
transformed into peak and average constraints on the input signal (and not on its 
square as usual in radio communication). For a more detailed description of the 
free-space optical intensity channel see |21j . 

2.2 Mathematical Channel Model 

We will now translate the above physical channel description into a simplified time- 
discrete channel model. The time-A; channel output is given by 

Yk = Xk + Zk. (3) 

Here, Xk denotes the time-fe channel input and the random process {Zk} models the 
additive noise. As described above this noise is mainly caused by strong ambient 
light. We therefore approximate it as a constant intensity term r] and some intensity- 
fluctuations around r/. Because these fluctuations are caused by many independent 
sources, we assume that they are independent and identically distributed (IID) zero- 
mean Gaussian with a given variance o"^ > 0. I.e., 

{Zfc}~IIDAAM(r/,a2). (4) 

Since r] is constant, we may without loss of generality neglect it because the receiver 
can always subtract or add any constant signal. We then define a new channel 
output random variable 

Yk = Xk + Zk, (5) 
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where {Z/^} ~ IID A/'k(0, cr^). Notice that the new channel outputs {Y^} represent 
the fluctuations of the electrical output signal around its working point ry. 

Our channel model is memoryless and therefore we drop the time-index k. The 
channel output Y is then given by 

Y = x + Z, (6) 

where x denotes the channel input that is proportional to the optical intensity and 
therefore cannot be negative, 

X G M+ (7) 

and where the additive noise is 

Z~AAiR(0,a2) . (8) 

Hence, the conditional probability law VF(-|a;) of the output Y given input x G M.q 
has density 

/y|x(2/k) = -=L-^e-^, XGM+, yGM. (9) 



It is important to note that, unlike the input, the output Y may be negative since 
the noise introduced at the receiver can be negative. 

The restrictions on the optical peak and average power are translated into a 
peak-power and an average-power constraint on the input, respectively: 

Pi[X > A] = 0, (10) 
E[X] < £, (11) 

for some fixed parameters £, A > 0. Note that the average-power constraint is on 
the expectation of the channel input and not on its square. We denote the ratio 
between average power and peak power by a, 

« = f, (12) 

where < a < 1. Note that for a = 1 the average-power constraint is inactive, 
in the sense that it has no influence on the capacity and is automatically satisfied. 
This means that a = 1 corresponds to the case with only a peak-power constraint. 
Similarly, a ^ 1 corresponds to a dominant average-power constraint and only a 
very weak peak-power constraint. 

We denote the capacity of the described channel with peak-power constraint A 
and average-power constraint £ by C{A,£). The capacity is given by |22J 

C{A,£) = sup I {Q,W) (13) 
Q 

where the supremum is over all laws Q on X > satisfying Q{X > A) = and 
^q[X]<£. 

When only an average-power constraint is imposed, capacity is denoted by C{£). 
It is given as in (I13p except that the supremum is taken over all laws Q on X > 
satisfying EQ[Ar] < £. 
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3 The Q- Function 

Definition 2. The Q-function is defined by 



QiO= I -^e-Vdt, V^GM. (14) 



Some of the properties of this function are recalled in the following lemma. 
Lemma 3. The Q-function satisfies 

Q(-6 + Q(0 = i, VCGM, (15) 

and 

Qio) = i (16) 



2' 



and is bounded by 



^ e-#fl--^^ <Q(0<^e-^, e>0, (17) 



and 



2 

Its first and second derivatives are given by 



QiO<le-^, e>0. (18) 



/27r 

and 



Q'(e) = -^e-#, V^GM, (19) 



Q"(0 = -^e-^, VeeM. (20) 



/27r 

Thus, Q (•) is monotonically strictly decreasing for all ^ G M, strictly concave over 
(—00,0), and strictly convex over {0,oo). 

Proof. The proof of (|15|) and p6|) follows because -^=e~~ is symmetric around 0, 
and because it equals the density of a standard Gaussian random variable and hence 
integrates to 1. For a proof of the bounds (fT7|l and (fTSjl see [23j pp. 83-84]. □ 

Lemma 4. Lei ^Oi7 ^ 6e nonnegative constants, and let the function /(•) be 
defined as 

/(0 = i-Q(eo + o-Q(eo + 7-6, eG[o,7]- (21) 

T/ien /(•) zs strictly concave over [0,7] and symmetric around £, = 2- Furthermore, 
it is increasing over [O, decreasing over [217] ; ^^^d takes on its maximum value 
ati = l. 

Proof. See Appendix El D 
Lemma 5. For fi> ^ and ^ > 0.- 

?Q(e-/^)</^- (22) 
Proof. See Appendix |Bl □ 
Lemma 6. For ^, ^ > 0.' 

1 - Q (C - /^) < 1 - Q (-/^) + -Q (-M) . (23) 

Proof. See Appendix O □ 
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4 Results 



The results of this paper are partially based on the results in [21] and [21 Ch. 3]. 

Lemma 7. Given peak-power constraint A and average-power constraint £ the ca- 
pacity C{A,£) of the free-space optical intensity channel ([6]) has a unique input 
distribution Q* that achieves the supremum in (|13p . 

Proof. See [3j. □ 



Using this lemma together with the symmetry of the channel law (© and ([5])) 
and the concavity of mutual information in the input distribution, the following 
lemma can be proved. 

Lemma 8. // the allowed average power £ is larger than half the allowed peak power 
A, then the optimal input distribution Q* in ()13p satisfies 



Eq^[X] = U. (24) 

Thus, 

C{A,aA) = c(^A,^y l<c^<'^- (25) 

Proof. See Appendix [Dl □ 



To state our results we distinguish between three different cases: 

• Case L both an average-power and a peak-power constraint are imposed, with 
a G (0,i); 

• Case IL both an average- and a peak-power constraint are imposed, with 

• Case III: only an average-power constraint is imposed. 

We present firm upper and lower bounds on the channel capacity in all three 
cases. In all three cases their gap tends to as the available power tends to infinity, 
and thus, we can derive the asymptotic capacity at high power. We also present the 
asymptotics of capacity at low power. For case I and II we state them exactly, i.e., 
we present asymptotic upper and lower bounds whose ratio tends to 1 as the power 
tends to 0. For case III we present asymptotic upper and lower bounds whose ratio 
tends to 2\/2 as the power tends to 0. 

4.1 Bounds on Channel Capacity for Case I 
Theorem 9. IfO<a<^, then C(A,aA) is lower-bounded by 

0(A.aA)>-log(^l + A^^(_^) j, (20) 
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and upper-bounded by each of the two bounds 



1, .A^ 



C(A,aA) < -log(^l + a(l-a)^J , (27) 
C(A,aA)<(l-a(^)-C^*^"""'* 



log 



A e^-e-Ki+i) • 
CT ■ ^/2^/i(l-2Q(|)) 



- - + Q - + ^ + -r^= e ^ - e ^ 

+ /.a(^l-2Q(^^^^. (28) 

Here, fi > and 5 > are free parameters, and fi* is the unique solution to 

1 e~^' 

« = - - T^^- (29) 
fi* 1 — e 

The existence and uniqueness of a solution to (f29l) is guaranteed by the following 
lemma. 

Lemma 10. Let Lp be a function from the positive reals to the open interval (O, ^) 

1 e"'' 

f ■ -- -. — ■ (30) 

/X 1 — e 

Then, ip is monotonically strictly decreasing and bijective with the following limiting 
behavior: 

lim ip{n) = 0, (31) 

limv9(^) = ^, (32) 
lim (^•93(^)) = 1. (33) 

Proof. See Appendix |El □ 
A suboptimal but useful choice for the free parameters in upper bound ()28p is 

<5 = alogfl + -V (34) 



^i = ^i*[l-e-''^j, (35) 

where /i* is the solution to ([29|) . 

Figures [T] and (21 depict the bounds of Theorem [U] for a = 0.1 and 0.4, where (j28p 
is numerically minimized over 5, fj, > 0. 

Theorem 11. If a lies in (O, ^), then 

lim jc(A,aA) -log-l = -- log 27re - (1 - a);U* - log(l - a/x*) (36) 
AToo [ a } 2 

and 

C(A,aA)^a(l-«) 

AiO AVc72 2 ^ ^ 
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Figure 1: Bounds of Theorem [U] for a = 0.1 when upper bound (|28|) is numericahy 
minimized over 5, /i > 0. The maximum gap between upper and lower bound is 0.68 
nats (for | « 10.5 dB). 
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Figure 2: Bounds of Theorem [U] for a = 0.4 with numericahy optimized upper 
bound (f28]l . The maximum gap between upper and lower bound is 0.52 nats (for 
^ PS 6.4 dB). 
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4.2 Bounds on Channel Capacity for Case II 
Theorem 12. If a ^ [l'^]' ^^^^ C(A, aA) is lower-hounded by 

C(A,aA)>llog(l + ^), (38) 

and is upper-bounded by each of the two bounds 

C(A,aA) < ilogfl + ^y (39) 



<7 J J 'a\/2;(i-2c(f)) 

where 5 > is a free parameter. 

A useful but suboptimal choice for 6 is 

5 = alog(^l + ^y (41) 

Figure El depicts the bounds of Theorem I12t where upper bound (j4U|) is numericahy 
minimized over S > 0. 



Theorem 13. If a lies in [^,l], then 

lim |c(A,aA) -log-1 = --log27re (42) 

AToo [ (T J 2 

and 

lim%i^ = l. (43) 



Note that ()42|) and (j43p exhibit the weh-known asymptotic behavior of the ca- 
pacity of a Gaussian channel under a peak-power constraint only [22j. 
Based on the right-hand sides of (j36p and ()42p we define 

,.^j-^log2TTe-{l-a)iJ,*-log{l-aii*), < a < i, 
I — ^ log27re, ^ < a < 1- 

Thus for a G (0, 1), x(q^) represents the second term in the high SNR asymptotic 
expansion of the channel capacity C(A, aA). It is depicted in Figure HI Note that 
when a tends to 0, then x(ck) tends to — oo. This can be seen by rewriting x(ct) for 
a G (0, i) using ^ as 

X(a) = -^log27re-a/i* -logj-^^— r, a G ^0, , (45) 

and then noting that by Lemma [10] (in particular by ([3T]) and (|33l) ) /x* | oo and 
a/x* t 1 when a [ 0. 
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Figure 3: Bounds on capacity for q G [^) l] according to Theorem 1 12t where upper 
bound ()40p is numerically minimized over 5 > 0. The maximum gap between upper 
and lower bound is 0.50 nats (for ^ ~ 6.4 dB). 
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Figure 4: The term x{<^) for a G (0, 1]. 



4.3 Bounds on Channel Capacity for Case III 

Theorem 14. In the absence of a peak-power constraint the channel capacity C{£) 
is lower-bounded by 

1 , / e^-e \ 



and is upper-bounded by each of the bounds 



C{£) < log (^pe-^ + V2^aQ (^^^^ - log (^2^^) - ^ 



'e 



C(£) < log + V2iaQ (^)) + Iq (^) + 

--log27^e(J^ (5>0, (48) 

where /3 > and 5 are free parameters. Bound ()47p onZy holds for 6 < —ae~^, while 
bound (148 p on/y /io/c/s /or (5 > 0. 



A suboptimal but useful choice for the free parameters in bound ([TTjl is 



= -2cj Jlog |, for ^ < e"^ w -0.4 dB, (49) 
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and for the free parameters in bound (|48p is 

6 = a\og(^l + ^y (51) 
1 / as'' 



Figure [5] depicts the bounds of Theorem [H] when the upper bounds ()17|) and (fl8]l 
are numerically minimized over the allowed values of /3 and 6. 

Theorem 15. In the case of only an average-power constraint, 

lim{c(£:)-log-| = ^log;f (53) 



and 

C{£) 



lim^^^<2, (54) 



lim^^gL>^. (55) 

Note that the asymptotic upper and lower bound at low SNR do not coincide in 
the sense that their ratio equals 2\/2 instead of 1. 



5 Derivation of the Firm Lower Bounds 

To derive the lower bounds in Section Owe use the entropy power inequality. 

Lemma 16 (Entropy Power Inequality). // X and Z are independent random 
variables with densities, then 

Proof. See [ig Theorem 17.7.3]. □ 



One can find a lower bound on capacity by dropping the maximization and 
choosing an arbitrary input distribution Q in ()13p . However, in order to get a tight 
bound, this choice of Q should yield a mutual information that is reasonably close to 
capacity. Such a choice is difficult to find and might make the evaluation of I{Q, W) 
intractable, because already for relatively "easy" distributions Q the corresponding 
mutual information is difficult to compute. We circumvent these problems by using 
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Figure 5: Bounds on capacity according to Theorem [T3] when upper bounds (j47|) 
and ()48p are numerically minimized over allowed values f3,S. The maximum gap 
between upper and lower bound is 0.57 nats (for ^ ~ 2.8 dB). 
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the entropy power inequality (j56p . For any probability distribution Q with density 

C>I(Q,T^) (57) 
= h{Y) - h{Y\X) (58) 
= h{X + Z)- h(Z) (59) 

>ilog(e2'^W+e2'^(^))-/i(Z) (60) 

I ( 2h(X) \ 

where ()60p follows from Lemma [HI To make this lower bound as tight as possible 
we will choose a distribution Q that maximizes differential entropy under the given 
constraints [lEl Ch. 12]. 

5.1 Lower Bound ( !26l) of Theorem [9] 

The input distribution Qi that maximizes differential entropy under a nonnegativity 
constraint, a peak constraint, and an average constraint has the density [l8j 

-•— ^- -e- — , 0<x<A, (62) 

where /U* has to be chosen such that the average-power constraint is satisfied, i.e., 
fi* is given as the solution to 

1 e~'^* 

a = -. (63) 

By Lemma [TO] such a solution always exists for < a < | and is unique. The bound 
(I26p now follows from ()6ip by computing /i(X) under the probability law Qi. 



5.2 Lower Bound ([3HD of Theorem [H 

The uniform distribution over [0, A] maximizes differential entropy under a nonneg- 
ativity and a peak constraint [18j . We choose Q2 to be this uniform distribution and 
note that then Eq^ [X] = ^ , and hence Q2 satisfies any average-power constraint 
larger than ^. Lower bound (I38p follows directly from (I6ip by computing h{X) 
under the law Q2- 

Notice that the uniform distribution Q2 represents the limit of the input distri- 
bution Qi in Section [5.11 when a t |. Indeed, by Lemma \TU\ and in particular by 
the limit §21, 

lim/x* = 0, (64) 

and hence when a t ^ the density of Qi converges pointwise to the uniform density 
over [0, A]. 
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5.3 Lower Bound (HSI) of Theorem 



The distribution Q3 that maximizes differential entropy under a nonnegativity con- 
straint and an expectation constraint is the exponential density 



1 - 

-e"£, x>0. (65) 

The bound ()46p follows from (I6ip by computing h{X) under the law Q^. 

Note that the density of Q3 represents the pointwise limit of the density of Qi 
when a I 0. Indeed, by Lemma [TOl and in particular by ([3TI1 and ([33l) . 



lim fj.* = 00, (66) 
lim(a/i*) = 1. (67) 

aj,0 

Also, using Q = ^ we can rewrite the density of Qi, i.e., ([62]) . as 

'^■r-^e-^^, (68) 
c I — e a 

which by ([66]) and (j67|) tends to ([65|) when a | 0. 

6 Derivation of the Firm Upper Bounds 

The derivation of the upper bounds in Section [H is based on Proposition [TJ 

C<svipEQ[D{Wi-\X)\\R{-))] (69) 
Q 

/oo 1 1 

\ogdR{y)dWiy\X) --log2W. (70) 
-00 J ^ 

Hence, we need to specify a distribution R, evaluate the integral (I70p . and finally 
upper-bound the supremum in ([70]l . To upper-bound the supremum we shall present 

which hold for arbitrary input laws 



upper bounds on Eg — log dR{y) dW{y\X) 
Q satisfying the imposed power constraints. 

6.1 Upper Bound ((271) of Theorem [9] 



To derive the first upper bound (I27p we choose an output distribution Ri corre- 
sponding to a Gaussian random variable of mean £ and of variance (cr^ + (A — 
i.e., Ri has density: 

1 

f^(y) = — e 2-^+2£(A-£) yGM. (71) 

V27r ((T2+f(A-0) ^ ^ 
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For arbitrary law Q satisfying Eq[X] < 8 and Q{X > A) = this yields 



log h{y)dW{y\X) 



\ogJ2TT[a^ +£{A-£)) +E 



Q 



< log J27r((j2+f(A-£:)) +E 



X^ + a^ - 2£X + £^' 

2a^ + 2£{A - £) 
{A - 2£)X + a"^ + £^ 



<log^27r((j2+^(A-f)) + 
= ^log27re {a^ + £{A-£)), 



2cj2 + 2£{A - £) 
{A - 2£)£ + ^2+^2 
2^2 + 2£{A - £) 



(72) 
(73) 
(74) 
(75) 



where the first inequality follows from X"^ < AX due to the peak-power constraint, 
and where the second inequality follows from the average-power constraint using 
that f = a < i, i.e., A-2£>0. Since the resulting upper bound in (|75|) does not 
depend on the input law Q, (f75|) also upper-bounds the supremum in (fTOl) . and ([2 
is proved. 

6.2 Upper Bound ([28]) of Theorem [9] 

To derive (|28]) we choose the law on the output R2 to have density: 



/2(y) = < 



e 2<T^ , 



y < -S, 



— ^rr/ye -S<y<A + 6, 



27r(T 



-e 2<t2 



y > A + 5, 



(76) 



where 5 > and > are free parameters. This leads to the following expression 



logi?^(y) dW{y\X) 



-s 



1 _(y-xf I _y_ , 

e 2<t2 log \/27r(Te2<T2 



27r(T 



A+(5 



-5 



ci 



27rcr 



-e 2<t2 log 



'e-^ /.(l-2Q(|)) ' 
A e^_e-M(i+i) 



dy 



+ E, 



°° 1 (y-xf / (^-A)- 

e 2(t2 log v27ro"e 2,72 



A+5 v2vr(T 



dy 



(77) 



We investigate each term individually. We start with ci: 

-<5 



ci = Ec 



27rcr 



1 / , y 1 

e 2<t2 log v27rc7e2^ dy 



(78) 



Lapidoth, Moscr, Wiggcr, March 10, 2009, submitted 



18 



= Er 



log (^/2^c7) • Q 
log (^^2^(7) • Q 
log f \/2^ct') • Q 



5 + X\^ 



+ 



2a2 

1 



00 V 27r(T 

5 



e 2<T^ dy 



(79) 



y2^^e ^dy (80) 



27rcr 

2"' Vo-y ' 2^/2^0- 



e 2<t2 ^ 



(81) 



where the inequality follows from the assumption 6 > that ensures that (y-x)2 > 
y2 for all X > and y < —(5. Similarly we get for C3: 



C3 = Eq 
= Eq 
+ Eq 



1 



e 2<t2 log I v27r(Te 2,72 j dy 
(5 + A-X 



A+(5 v27rcr 
log f V2^ct) • Q 



(y-A)" 



1 



^ / (y-A)2^^e ^^dy 



A+<5 



< E. 



log f ^/2^(t) • Q 



27r(T 
(5 + A-X 



cr 



1 



(y-A)^ 



A+(5 



e 2<t2 dy 



log f V2^(t) • Q 



27r(T 
(5 + A-X 
cr 



2^ 



+ 



5 



cr/ 2V27r(T 



-e 2(T^ 



(82) 

(83) 

(84) 
(85) 



Here, the inequality follows because (y — x)^ > (y — A)^ for all x < A and y > A + 5. 
Finally, for C2 we have 



C2 = Eg 

= Eq 
+ E 
+ E 



1 



A+(5 



1-Q 
l-Q 



-e 2<t2 loQ 



e A" — e 



A 



AV27r 



27rc7 
5 + X 



a 

6 + X 
a 

e 2<t2 



e A dy 



(86) 



Q 



6 + A-X 



a 



log 



e A" — e 



-M(i+i) a' 



1-2Q(|) /.^ 



6 + A-X\\ 11 ^ 
~x 



(A+S-X) 



A 



(87) 



Plugging ci, C2, and C3 into (f77]) and combining this with (f70|) we get the following 
bound: 



c<ei 



+ Eq 

+ Eq 
+ Eq 



1-Q 
1-Q 



AV27r 



(^ + a: 

cr 

(5 + a:' 

cr 

js+x)'-' 

e 2<t2 



Q 
Q 



(5 + A-X 



cr 



A ( eA* - e 



log 



27rc7M(l-2Q(|)) 
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It is shown in Appendix [F] that 



A ( 



log 



2^a/i(l-2Q(f)) 



> 



(89) 



for any values of A, cr,6, fj, > 0. Therefore we may use Jensen's inequality combined 
with the concavity shown in Lemma [H to conclude that 



-Q 



l-Q 



6 + X 



Q 



6 + A-X 



a 



log 



27raM(l-2Q(f)) 



< 1- Q 



6 + Eq[X]\ /^ + A-EQ[X]^^ / A(eA -e-^-A, 



a 



Q 



a 



log 



fj.S 



27rc7/x(l-2Q(|)) 



< l-Q 



S + aA 



a 



Q 



6 + {l-a)A 



log 



A f e A" 



27ra^(l-2Q(|)) 



(90) 
■ (91) 



Here, the last inequality follows from the average-power constraint, the assumption 
that a < ^, and from the fact shown in Lemma H] that 



l-Q 



5 + C 



a 



Q 



<5 + A-e 



a 



is monotonically increasing for < ^ < ^. 

Next, we once more use Lemma [H to upper-bound (l-Q (^^) — Q { ^~^^~^ )) 
by its maximum value that is taken on for x = 4: 



l-Q 
< E, 



d + X 



a 



Q 



6 + A-X 



a 



l-Q' 2 



A' 



X 



cr 



Q ' 2 



cr 



A 



.«fl-2Qf^^ 



(92) 
(93) 
(94) 



And finally, we use the monotonicity of the exponential function and the fact that 
X £ [0, A] to show the following: 



-Q 



AV27r 



e 2<T^ 



2^ 



< 



AV27r 



(A+sy 



e 2(T^ — e 2(T 



(95) 



Finally, combining ([88]) with ([9T]) . ([M|) . and ([95|) yields the bound on channel ca- 
pacity given in ((28]) . 
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6.3 Upper Bound fl39l) of Theorem | 

To derive bound (j39p we choose a Gaussian output law with density 



/3(y) 



(96) 



This yields 



logE!,{y)dWiy\X) 



logW2^ ( ct2 + ^ ) + Eq 



^2 + ^2 - AX + ^ 



2a2 + ^ 



9 , 



1 



- log 27reo-^ 1 + 



- 



(97) 

(98) 
(99) 



where the inequality follows because X'^ < AX due to the peak-power constraint. 
Combined with (j7U|) this yields the claimed result. 

Note that the relation Eq[X] < aA has not been used. Therefore this bound is 
valid for all a S [0, 1] and especially for all a S [^i l] • 

6.4 Upper Bound (iOl) of Theorem [12 

The derivation of this bound is similar to the derivation of ()28p . We choose an 
output distribution R4 with density: 



h{y) 



1 



'2na 

l-2g(|) 
A+25 



1 



2na 



6 2^, y < -S, 

-6<y<A + 5, 

e 2<t2 ^ y > A + 5, 



(100) 



where 5 > is a free parameter. This leads to the following expression: 



log R'MdW{y\X) 



■ f-^ 1 (y-xf ( ' 

/ e 2<t2 log I \/27rcre2^ dy 

7-00 v27r(7 V / 



A+5 



(y--y)2 



-<5 



27ro- 



e 2<t2 log 



A + 2(5 
l-2Q(! 



■dy 



1 



C2 



A+5 v27r(T 



e log I v27r<7e 2^2 j d?/ 



(101) 



C3 



Lapidoth, Moscr, Wiggcr, March 10, 2009, submitted 



21 



We have already upper-bounded ci and C3 in ([8T|) and ([85]) (assuming that 5 > 0). 
Similarly to C2, we compute C2 as follows: 

»A+(5 



C2 



1 



<5 

1-Q 



27ro- 

6 + X 



2<T^ loE 



A + 2(5 

i-2Q(| 

(5 + A-X 



■dy 



log' 



A + 2(5 



(J / \ a // -i_2Q(|) 
Plugging ci, C2, and C3 into (jlOip and combining this with ()70p we get 



(102) 
(103) 



C < Er 



1-Q 



5 + X 



5 + A-X 



a 



log 



A + 2(5 



27r^(l-2Q(|)) 



e 2<T^ 



For any A > and 6 > 



A + 2(5 



> 



2^ 



> 1, 



(104) 



(105) 



V2^a{l-2Q{§)) - V2^(l-2Q(|)) 

where the first inequality follows from dropping A and the second inequality is proven 
in Appendix W\ Equation (I273p . Hence, 

A + 2(5 



log 



27ra(l-2Q(|)) 



>0, 



(106) 



and we can use Lemma H] to upper-bound (l — Q (~^) — Q ( ^ ^ )) by its maxi 



mum value that is taken on for x = ^: 



-Q 



1-Q 



5 + X 



a 



Q 



S + A-X 



log 



A + 2(5 



< 1- Q ' ' 



(T 



log 



27ra(l-2Q(|)) 
A + 26 



2vra(l-2Q(|)) 



1-2Q' 2 



a 



log 



A + 2(5 



2vra(l-2Q(f)) 



(107) 
(108) 



Again we have not used the relation Eq[X] < qA, and hence the bound is valid for 
arbitrary a G [0, 1]. 

6.5 Upper Bound gTl) of Theorem [H 

One of the main challenges of deriving the upper bounds of Theorem 1 141 using duality 
is that without a peak-power constraint the input can be arbitrarily large (albeit 
with small probability). This makes it much harder to find bounds on expressions 
like Eq [X^] . Still, we shall derive upper bound (|Tr|) using duality. 
We choose a distribution with density 



My) = { 



— ^ „ 
-e 2^ 



/3e 2<t2 +v^crQ(|) 



y < -s, 



y+S 



(109) 



-e 2^e (3 , y > —6, 
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where 5 € M and f3 > are free parameters. This leads to the fohowing expression: 



/oo 
logR',{y)dW{y\X) 
-oo 



log ( Pe ^ + V2TraQ 

.20-2 j_g ^2Tra 
IP 



+ E, 



roo ^ 
J-S V^TT 



e 2<T^ dy 



-Q 



1 

2^ 



-s 



2 1 ^y-y 
y — e dy 



27ro- 



5 \/27ra 



(y-xy 



e dy 



= log ( /3e 37^ + V27r(TQ 

^2 



2 + 2^j2 



a 



+ T^^e 2<t2 



2V27r(7 



2^(^-2 



C4 

C7 



5 + X 

3 



l-Q 



s + x 



^3 



-e 2(T^ 



C6 

We now restrict the free parameter S to satisfy 

a 



S<- 



and continue as follows. For arbitrary input law Q such that Eq[X] < 6: 

'6 + X' 



-Q 



2^ 



2^2 



a 

5 + X 
a 



1 

^2^ 



< 



Q 

(T a 



X X 
2a ' "a 
X -5 
2^ ' V 

2^' 



-Q 



(5 - X _is+xf 
e 



2V27rcr 



(T 



<0; 
= Eq 
<Eq 



2^2 

2a2 



(7 0" 



-Q 



<0; 



S 



tQ - 



(110) 



(112) 

(113) 
(114) 
(115) 
(116) 
(117) 

(118) 
(119) 
(120) 
(121) 
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-Q 



d + X 



a 



a {s+xf 

e 2<t2 



< 



X 

J 



4^ 



27r/3 



(122) 
(123) 



Here, the first inequality (fTT3|) follows from Q (^ < 1 for all ^ G M; (fTTS]) follows 
from Lemma [S] using assumption (|112p ; in the subsequent inequality pi6|) we use 
the average-power constraint together with ()112p : ()117p follows from X > and 
5 < Q (by (|112p ): in pi9p we use Lemma El and the subsequent inequality (|12Up 
follows again from the power constraint together with ()112p : ()12ip is due to ()112p : 
()122p follows because Q(^) > for all ^ G M; and in the last inequality (jl23p we 



upper-bound e 2<t2 by 1. 

Combining (fTT3]) - (f]i3|) with (fTTTI) and ([TO]) yields the claimed result. 

6.6 Upper Bound (HHD of Theorem [H 

The bound (j48p follows from the same choice (|109p as we have used for the bound 
(I47p . However, here we will restrict the free parameter 5 to be nonnegative: 



5 > 0. 



We can then bound C4 as 



C4 



-Q 



1 



1 

2^ 
1 

2^ 
(5 



1 



■00 
-5 



2Tra 
1 



dy 



27rc7 



e 2^ dy 



+ 



2V2ira 



-e 20-^ 



(124) 

(125) 
(126) 
(127) 



where the inequality follows from the assumption 6 > and the nonnegativity of X 
that ensure that {6 + X)^ > 6^. 

Moreover, using the concavity and monotonicity of ^ ^ (1 — Q{£,)) for ^ > 
(see Lemma [3| and Jensen's inequality, we bound C5 as follows: 



C5 



6 + X 



6 + £ 



a 



and, using the nonnegativity of Q (•) and of X, we get 



C6 



-Q 



6 + X 
5 + X 



l-Q 



a 



< 



5 + £ 



a 



2TTf3 



6 + X 
a 

-e 2<t2 



+ 



e 2<t2 



27rP 



P V27rp 

Combining (fTTTI) . (fT2711 . (fT28]l . and (fT3T]l with (I70|) yields the claimed result. 



(128) 

(129) 
(130) 
(131) 
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7 Derivation of Asymptotic Results 

7.1 High-SNR Asymptotic Expression (IHUj) in Theorem 1111 

To derive (j36p we choose the free parameter 6 of Theorem [9] as in (j34p , and set the 
parameter /i equal to fi* , the solution to (I29p . Then, 



lim - = 0, (132) 

AToo A 

lim 5 = oo, (133) 

AToo 



2 



lim5e~^=0, (134) 

AToo 

and therefore from (|28p follows that for a G (O, ^): 

lim j C(A, aA) - log - I < log ^ ^ - - + ^*a. (135) 
AToo i ' a } V2^n* 2 

On the other hand, from (1261) follows 



f All 1 - e-^ 
lim <^ C(A, aA) - log - ^ > — log 27re + au* + log . (136) 

AToo [ (7 J 2 fl* 

By Equivalence (j29p and basic arithmetic reformulations it can be shown that the 
two bounds (I135p and (I136p coincide and equal the limit (f36|) . 



7.2 High-SNR Asymptotic Expression f H2|) in Theorem 

To derive (P2]) we use lower bound ([38|) and upper bound (Pop for (5 as in (|^T]) . By 
this choice of 6: 



lim Q 



A + 2(5^ 



AToo \ 2a 
and hence from upper bound ()40p follows 

lim <^ C(A,aA) - log — 
AToo I ' a 



lim — 

AToo A 


= 0, 


(137) 


lim 6 


= oo, 


(138) 


AToo 




_ 

lim 6e ^ 


= 0, 


(139) 


AToo 




log (A + 25) 


= 0, 


(140) 



1 + 2| „^/A + 25\, A + 25 



< lim <^ log ^ - 2Q — log 



AToo]^ -1_2Q(|) 2a J %^(i_Q(|)) 

1 f 5\ 5 *0 

-log2.e+Q - (141) 



-llog2»e. (142) 



Lapidoth, Moscr, Wiggcr, March 10, 2009, submitted 



25 



On the other hand, by lower bound (j38p : 

Um |c(A,aA) - log— 1 > lim - log + ^— ^ 



--log 27re. 



(143) 



The two bounds coincide and therefore they prove (I42p in Theorem 1131 



7.3 High-SNR Asymptotic Expression 



in Theorem 



To derive ()53p we use bound (j48p with the following choice of the free parameters /3 
and 5: 



for £ > a. Then, 



5 = cta/Iok — , 



(144) 
(145) 



lim 5 = oo, 

f too 



lim ^ 

f too O 

_ 

lim (5e 2^ 

£too 

62^ 



lim 

ftoo c 



0, 
0, 

0. 



(146) 
(147) 
(148) 

(149) 



Hence, we get from (148 



lim I C(<?) — log — 1 < lim < 
£'too I cr J f too 




V2^o-e^Q (^ 




5 + £: + ^e-5:; 



^ - log 27re 

1 , 1 , e 
= 1 log 27re = - log — . 

2 ^ 2 ^27r 

On the other hand, we get from lower bound (I46|) that 

li.n{c(£)-log|}>il„g|^. 
These two bounds coincide and therefore prove (j53p in Theorem 1151 



(150) 
(151) 

(152) 



7.4 Low-SNR Asymptotic Expression ( 1371) in Theorem 1111 

In order to prove the low-SNR asymptotic expression (I37p in Theorem [TT| we de- 
rive an asymptotic lower bound that combined with upper bound (j27p yields the 
desired result. The lower bound we propose is based on Theorem 2 in ^20j. For the 
channel ([6]) under consideration, the technical conditions A-F in [20] are fulfilled, 
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and Theorem 2 in pQ] states that for peak-constrained inputs |X| < A the mutual 
information satisfies 

I{X;Y) = ^-^ + o{A% (153) 
where o(A^) decreases faster to than A^, i.e., 

nm^^ = 0. (154) 

m A^ 

We restrict attention to settings where < A < 1. Then, the binary input 

X = |°' with prob. 1 - a, ^^^^^ 

lA(l — A), with prob. a, 

is nonnegative and peak-constrained, and it satisfies the average-power constraint 
E [X] < aA. Hence, by (11530 and since for the choice of X in ()155p Var {X) = 
a(l-a)A2(l-A)2, 

a(l-a2A^(l-A)2 ^ < A < 1, 0<a<i (156) 

2cj^ 2 

and we obtain the following asymptotic lower bound on the capacity for a G (O, 

C(A,aA) a(l-a) 1 , 

2 ^ 9 ' 0<a<-. (157) 

Furthermore, by upper bound ()27p and since log(l -|- ^) < ^, for ^ > 0, 

- — C(A,aA) a(l-a) 1 , , 

The low-SNR asymptotic expression (|37p is then established by the last two inequal- 
ities. 



7.5 Low-SNR Asymptotic Expression in Theorem 

To prove the low-SNR asymptotic expression (|43p we derive an asymptotic lower 
bound which combined with upper bound (|39]) yields the desired result. The lower 
bound we propose is again based on Theorem 2 in [20]. 

We choose a nonnegative and peak-limited binary input X which equiprobably 
takes on the values and A(l — A), for < A < 1. Then, we apply the same steps 
as in the previous section, and in analogy to ()156p obtain 

/(X;y) = ^^ii-2^ + o(A2), 0<A<1, (159) 

and 

, C(A,aA) 11 , , 

AlO A^/cr^ 8 2 

Furthermore, by upper bound ([5^ and from log(l -|- i^) < ^, for all > 0, 

— C7(A,aA) 11 , , 

m \' ^ < -, - < a < 1. 161) 



The low-SNR asymptotic expression (j43j) now follows by the last two inequalities. 
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7.6 Low-SNR Asymptotic Expressions flMl) in Theorem 

The asymptotic upper bound follows from upper bound dU]). We choose 6 as 
in dM]), i.e., 

-2^7^^^, (162) 



A 



and 

/3 = ^. (163) 



Then, from (gT]): 



log F^rze 2^ + Q 



C(^) ^\^fv^a" ^^Va;y 2CTyiogf-£: 



=s(-|) 



(165) 



Next we note that 



lim^log^ = oo, (166) 
limQ(-)=l, (167) 
l^^M^til^O, (168) 



£10 

and, using Q (C) < 1 and log(l + C) < ? for all ^ > 0, that 

iog(-7^ + Q(^)) logf:^ + l 



lim vv^'^^^-V < lin, ^y/^^ / (169) 



< lim ^ v"^^^ (170) 
= 0. (171) 



Together with p65p this leads to 



lim^^^^£^<2. (172) 
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7.7 Low-SNR Asymptotic Expressions flool) in Theorem 

We shall derive a new asymptotic lower bound at low powers which proves (|55p . The 
lower bound is obtained by lower-bounding the mutual information I{Q,W) for Q 
the probability measure with probability mass function 

, , f 1 - ^, if X = 0, , , 

q{x) = \ ^ (173) 

where for sufficiently small £ we choose 



xi = o-yclog-, (174) 

for some constant c > 2. Note that xi | cxd as iS | 0. In the remaining of this section 
we assume § < ^ so that the probability mass function in (I173p is well-defined. 
The probability density of the channel output Y corresponding to the input with 
probability mass function (I173p is given by 

. . X f £\ 1 -jd £ 1 (^-^1)' , , 

/y y =1 e ^ + ; e 175 

In order to evaluate the mutual information I{Q, W) for the chosen binary input 
distribution we write it as (see 



I{Q,W) = j D{W{-\x)\\W{-\0))dQix) - D{R{-)\\W{-\0)). (176) 
We can then evaluate the first term on the right-hand side as 
D{W{-\x)\\W{-\0))dQ{x) 

= —D{W{-\xi)\\W{-\0)) (177) 

Xl 



e 2ct' 

£: /■°° 1 (y-^i)' ( vx\ X? 




E ( x\ x\ 

Ex\ Jc £ f a 

^•-,/log-. (181) 



dy (178) 

dy (179) 
(180) 



2cj2 2 cjV 

Evaluating the second term is more difficult, and in fact we only derive an upper 
bound on it which exhibits the desired asymptotic behavior at low SNR. We shall 
show that 

m)|MW))<V£_l ,182) 

from which follows by ()176p and ()18ip 

^ /(QJF) ^ 1 (J 
£J0 |\/Egf 
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The desired asymptotic lower bound in (j55p then follows because ()183p holds for any 
c> 2. 

Thus, in the remaining of this section we wish to prove ()182p . To this end, we 
write 



D{R{.)\\wm) 



/y(y)log 



^1 J V2^ 



= 6 2^ + 



£ 1 



^1 



e 2<t2 



dy (184) 



V2^ 



=e 2<T^ 



/y(y)log (1 \ e^~^ ) dy 



C7 



+ 



+ 



2 c 



/y(y)log (1 \ e^"^ ) dy 

Xi Xl 



C8 

1 \ e^"^ ) dy 

Xl Xl 

^ c 



(185) 



and upper-bound cy, cs, and cg. We start with upper-bounding c-j where y > 



c? < / /y(y) fog ( 1 + —e^~~^ ] dy 

Xl Xl 



[ ' /y(y)logldy = 0, 



(186) 
(187) 



and hence, 



lim ■ 



< 0. 



£10 Sy'logil/S) 

Next we examine cg. Using ^ > and log(l + ^) < ^ for all ^ > we get 



(188) 



11 £ £ y^^JX 
log I 1 1 e 2^ 

Xl Xl 



< login e^"^ < — e^"^, (189) 

Xl J Xl 



and hence, 

C8 < 



2 c 



2 c 



jY{y)—e^ ^ dy 

Xl 



1 



£\ £ 1 



2;iy Xl V27ro-2 



e 2<T^ 



+ 



1 



XlJ V27r(T2 



2 c 



1 _(v-^ir 

e 2a2 



(190) 



(191) 
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+ — / e S^^dy (192) 



2 



- ^ii) ^^fc - th) 



where in the last equahty we used the definition of xi. We analyze the limiting 
behaviors of the two summands separately. For the first term 

V£ /i<-,rr s.\ — n 



Mm ^ T. — , = 0. 

(195) 

To deal with the second term we further upper-bound it using (fT7|) and the nonneg- 
ativity of Q (•): 



>o 

c / 3 1 n2 



< A^i - f i ) fi96) 



1 I 1 l' c I 1 

IE 



(logf)^/^ V2^(3^-VH)- 



(197) 



Note that whenever c > 2 then (f + 7) > 1 and ( ^^5^^ \/c) 7^ 0, and therefore 



lim ^ . < n (198) 



Finally, we examine the limiting behavior of cg. To this end we rewrite cg as 
Cg = / /y(y) log ^ dy+ / /y(y)log — dy 



+ /^^ /y(y) log 1 + ^ ^ ^^ e-1^+^ I dy. (199) 

V 

C9,3 
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We separately examine cg^i, 09^21 and cg^s and start with cg^i: 

/■°° // £\ 1 £ 1 _iy^ 
cg 1 = / 1 e H e 2<t2 

■ - <-»> 



2 ^ c 



<l >Oif£<i 

— a — A 



>o 



xi 0-2 ^/2^ 2o-2 



<1 

,2 



< — 5^U + cj + e 5^Vc 2J + ^ 203) 



where in the last step we used the definition of xi (I174p . Again, since c > 2 we have 
I + i > 1, and therefore 

lini^^V4= = lim^(-) = 0. (205) 

Moreover, 

hm ^y^ =lim = (206) 

and 

and we conclude that 

C9,i ^ \/c 

-^Tofyi^- 2 

Next we analyze cg^2- Note that ^ < 1, for § < ^, and hence log < 0. 
Therefore, 

C9,2= I + + llog(-l (209) 



f^--)2(? + -) + -2(--f^)') 



xi 



>Oif£<i -° 
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<£g(£i_|i)l„,(f , (210) 

xi \C(T 2a/ 



2 - V \ log c log -^-^f=W ■ (211) 



Since c > 2 the term Q (^(^"^ — y^log ^ j tends to 1 when £ tends to 0, and 
therefore 

c^^^-logf-^logc-lloglogf ^_ 1 

Finally, we analyze C93. Using that ^ — l>Oif^<^we lower-bound 
y > f + f to get 

C9,3 = fviy) log f 1 + - 1) e-'^+i^^ dy (213) 



2 ~^ c ^ 

2 



< j^^^^^ /y(y)log (1 + - 1) e"^) dy (214) 
xi) ^ (2(7 ccj) ~'~ xi ^ (co" 2(t) ) 



<i 

,2 



•log(^l+ (y-l)e"^j (215) 

<£1 

< i-e i^V2 + cj + _ . _e 2^U 2j j . / ^ j (217) 



l/p\2i2-^-; 1 /p\2U-2;^M / fj 



2 \a J 2y/clogf Vf^y \ £ 

Here, (^171) follows by (US]) and by log(l + < ^, for ^ > 

Since for c > 2 
limiting behavior: 



Since for c > 2 we have f (^ + > 1 and (^ — > 0, we obtain the following 



hm ^ ^ ^'^ : < 0. (219) 

By (I208D . ([2T2|1 . and (|2T9|) we conclude that 

— c^ ^^^_ 1 (220) 
£iofyfo^- 2 

and hence, as we have set out to prove, by combining ([188]), (|198p . and ()220p we 
obtain 

^D(fy\\fnx_=o) ^ Vc _ 1 (221) 



Lapidoth, Moscr, Wiggcr, March 10, 2009, submitted 



33 



A Proof of Lemma [4] 

We start by showing that /(•) is strictly concave. Since ^0)7 ^ 0, the functions ^ ^ 
Q (Co + and C Q (Co + 7 — ai's strictly convex over [0, 7] by Lemma [3l Then, 
also the function (Q (Co + C) + 2 (Co + 7 — 0) must be strictly convex over 

[0, 7] and we conclude that the function / : C (1 — Q (Co + C) ~ 2 (Co + 7 — 0) is 
strictly concave. 

The symmetry of /(•) around C = 2 ^e seen by noting that for all ^' G [O, ^] 
f{l+^')=^-Q{^o + l+e)-Q{^o + 7-l-e) (222) 
= 1-Q(Co + ^+C') -Q(Co + |-C') (223) 



is identical to 



/f?-C') =1-Q(Co + J-C) -2fCo + 7-? + C') (224) 



2 " / ^ V"" 2 " / V"" ' 2 

= 1-Q(Co + |-C') -Q(Co + ^+C'). (225) 

Finally, that /(•) has its maximum at ^ and is monotonically strictly increasing over 
[0, ^] follows by the symmetry and the strict concavity. 

B Proof of Lemma [5] 

For ^ < fi we have 

^Q(C-^)</", (226) 
<^l <i 

because C ^ aiid Q (") is nonnegative. 

Let us then assume that C > introduce a variable substitution y = — fi. 

Then for y > 0, we get 

CQ(C-/") = (y + /")Q(y) (227) 

<(y + ^)ie-^ (228) 

<|e-^ + f (229) 

1 1 /i , , 

- 2^'" + 2 

< /i. (231) 

Here, the first inequality ([228]) follows from ()18p : in ()229p we upper-bound e 2 

by 1; then in (j230p we replace ye 2 by its maximum e 2; and the final inequality 
(|23ip holds because we have assumed that > e~ 2 . 

C Proof of Lemma [6] 

Define 



/i(C) = l-Q(C-/^), (232) 

/2(C) = i-Q(-/^) + -Q(-/^). (233) 
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We shall show that for all ^ > 



/i(e)</2(0, 



(234) 



where /i is a nonnegative constant. Let us start with the case < ^ < /i. From 
Lemma [3] it follows that /i(-) is strictly convex over [0,fj]- Moreover, note that 
/i(0) = /2(0) and that the slope of /i(-) at ^ = is smaller than the slope of f2i-)- 



d_ 



1 



5=0 



5=0 



2 



2tt 



1 



1 

< - 

1 : 

1 



27r 
1 



27r 



(235) 
(236) 
(237) 
(238) 
(239) 
(240) 
(241) 
(242) 



^ j_ 

Here, (I238p follows from upper-bounding 2 by its maximum e 2 ; the subsequent 

inequality (j239p from the fact that -^7= < ^; and the subsequent two steps (|240p 

and (|24ip follow from Lemma EJ 

Hence for small values of /i(^) < f2{0- Since /i is convex, it can intersect 

with the linear function /2 at most one more time apart from the intersection at 

^ = 0. However, for ^ = 



= 2' 

f2{^) = 1, 



(243) 
(244) 



i.e., f2 is still larger than /i, so no intersection has taken place in the interval (0, /x]. 
For ^ > //, no intersection can take place either because by definition 



(245) 



This completes the proof. 



D Proof of Lemma [8] 

We first state an auxiliary proposition which is used to prove the lemma. 

Proposition 17. Let the random variable X take value in the interval [0, A], and 
let Z ~ A/'ir(0, o"^) be independent of X. Then, there exists a random variable X 
taking value in [0, A] and independent of Z that satisfies 

E X = ^A (246) 
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and 



Proof. Define X 



I{X;X + Z) >I{X;X + Z). 

A — X and note that 

I{X; X + Z)= I{X; k-X-Z) 
= I{X;A- X + Z) 
= I{A- X;A- X + Z) 
= I{X-X + Z), 



(247) 

(248) 
(249) 
(250) 
(251) 



where ([^15]) and 



fohow because I{U]V) = I{U; g{V)) whenever g is one-to- 



one; where ()249p fohows from the symmetry of the centered Gaussian; and where 
(|25ip follows from the definition of X. 

Let i? be a binary random variable that takes on the values and 1 equiprobably 
and independently of the pair {X,Z). Define the random variable X equal to X 
when B = 1 and equal to X when B = 0. We show that X (which takes value 
in [0,A]) satisfies both (f2l6]) and (piT]) . Condition dSMj) follows by the total law 
of expectation, by the definition of X, by the independence of B and (X,X), and 
because E[X] = A - E[X]: 



















E 


X 


= 1e 

2 


X 


B = 1 




X 


B = 



Uix] + hix] 



2 



(252) 

□ 

Condition (j247p follows because conditioning reduces differential entropy, because 
X is independent of {X,X,Z), and by ()25ip : 



liX; X + Z)=h{X + Z)- h{Z) 

> h{X + Z\B) - h{Z) 



l) + -h{X + Z\B 



hiX + Z\B 
]^{h{X + Z)-h{Z)) + \{h{X + Z) 
h{X + Z;X) + h{X + Z-X) 



0) - h{Z) 
-h{Z)) 



(253) 
(254) 

(255) 

(256) 

(257) 
(258) 



= I{X-X + Z). 
With the aid of Proposition [T7] we now prove Lemma [8) 

Proof (Proof of Lemma\B^. Let Q* denote the capacity-achieving input distribution 
(which exists by Lemma [7|). Then, by Proposition 1171 there exists an input distri- 
bution Q with average power 



1 



A 



such that 



I{Q,W) > I{Q*,W). 



(259) 



(260) 



Whenever a > ^, Q is a valid input distribution in the optimization in (fTsp and by 
(I260p it achieves capacity. But by the uniqueness of the capacity-achieving input 
distribution (Lemma [7]) the distributions Q* and Q must coincide, and therefore by 



-A. 



(261) 
□ 
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E Proof of Lemma [TO 



We start by proving that the function (/?(•) is monotonically strictly decreasing. This 
follows immediately by taking the derivative of (/?(•) 

v''(/^) = -^-(r^' (^62) 

which is strictly negative for /x G (0, oo). Then, by the strict monotonicity and 
because (/?(•) is continuous, (p{-) is bijective. 

We are left with proving the asymptotic results. Whereas the first limit ()3ip 
follows directly, the second limit ([32|) follows by rewriting the function (^(•) as 

1 - (1 + u)e-^ , , 

and then applying two times de I'Hopital's rule. Finally, the third limit (ISSp is 
obtained by observing that 

M^^) = 1 - (264) 

and that /xe^'* tends to zero as ^ tends to infinity. This concludes the proof of the 
lemma. 



F Appendix for the Proof of the Upper Bound 

It remains to show that 



A e A - e 



log ^^ : TT^ >0, V A, a, (5, ^ > 0. (265) 



We investigate the following expression: 



2^a/i (1 - 2Q (I)) - V2^afi (l - 2Q (f )) 



2^(1_2S(A)) 



2^(l-2S(^)) 



(267) 



(268) 



where the inequality follows because we drop a factor < 1. Now we note that 
since sinh(-) is a convex function over [0,oo) for any triple < < ?i < ^2 < oo, 
sinh(^i) lies below the line segment connecting sinh(^o) and sinh(^2), ^-e., 

sinh(ei) < (sinh(6) - sinh(eo)) • (269) 

42 - 4o 
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By choosing = and since sinh(O) = we conclude that 
sinh(Ci) sinh(^2) 



< 



6 



(270) 



Hence the function ^ i-^ ^'"^^^^ is monotonically increasing over (0, oo) and has its 
infimum in the limit ^ j 0, i.e., 



sinh(0>^.^sinh^^^^ Ve>0, 



where the limit follows by de I'Hopital's rule. Similarly, since the function ^ 
(1 — 2Q (,^)) is concave over [0, oo) and since 



(271) 



'"a-2Q(ei)) ^ ^(1-2Q(6)) 



> 



Hence the function 



6 - 6 

, 4^(l-2Q(«)) 



vo<ei <e2. 



(272) 



is monotonically decreasing on (0, oo) and has 

its supremum in the limit when ^ j 0, i.e., 



(273) 



2vr(l-2Q(0)^^^^^(l 2Q(0)^ 
2^ -?'iO 2e 

where again the limit follows by de I'Hopital's rule. 
Thus, for any A, a, fi,6 > 0: 



log 



A I e A — e 



27ra/i(l-2Q(|)) 



> log 



> log 




inf{|.sinh(0} 



sup| ^(yg(^n 



log - = 0. 



(274) 

(275) 
(276) 
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